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SEQUENCE LISTING 



110 > CURAGEN CORPORATION 
RASTELLI, LUKA 

120 > WNT-REGULATED CYTOKINE-LIKE POLYPEPTIDE AND NUCLEIC 
ACIDS ENCODING SAME 



J3 

o m 
^ o 
^ m 

S < 
m 

o 



130.> 10716/12 

140 ■ 09/715, 418 

141 > 2000-11-16 

150 ' 60/166, 177 

151 ' 1999-11-18 

160 * 48 

170- PatentIn Ver. 2.1 

2 10-^ 1 

2 11 ■ 212 

2 12 ^ DNA 

2 13 ' Mu s s p . 



<:400 > I 

gaattcagtg atgtagagag ggccattgag acactcatca agaacttcca taaatactct 60 

gtggcgggta aaaaggaaac actgacccct gctgagcttc gagacctggt tacccagcag 120 

ctgccaoacc tcatgccgag caactgtggg ttagaagaga aaattgccaa cctgggcaac 180 

tgtaatgact cgaaactgga gtttggaagc tt 212 



■2 10 > 2 

' 2 11 > 1670 

- 2 12 > DNA 

• 2 13 > Mus sp . 



220^ 




-2- 

■:221 * modif ied__base 
•:222 > ( 1541 ) 

■:22 3> "n" represents a, t, c, g, other or unknown 



v:400 > 2 



t caggtgagc 


tggctcctcc 


a tcctgtctc 


ccagctgcca 


gcaggtctcc 


ccctcct eta 


60 


ggtagatcat 


gat ccatcag 


c tcctgtggg 


gcaggctata 


ggacagacga 


i^aaaact(::aa 


120 


ct cacagaag 


gaaggaccag 


t gtaccagga 


acgatgggac 


agtgtcggtc 


agccaatgct 


180 


'ijagga tgccc 


aagaattcag 


tgatgtagag 


agggccattg 


agacactcat 


caagaacttc 


240 


cataaatact 


ctQtqqcqqq 


t aaaaaggaa 


acactgaccc 


ctgctgagct 


tcgagacctg 


30 0 


gtta'':;ccagc 


age tgccaca 


cctcatgccg 


agcaactgtg 


gijt tagaaga 


gaaaa ttgcc 


360 


aac ct gggca 


actgtaatga 


c tcgaaactg 


gagtttggaa 


gcttctggga 


gttgattgga 


420 


gaagcagcca 


agagtgtgaa 


qatqqaqaqq 


cctgttactc 


ggagctgagg 


acttctactt 


48 0 


q'gaacttgtt 


qaqqat qt t q 


qqqa taqqqq 

:p zj zJ z) 


agttttagag 


gcactggaaa 


taaaaccctc 


54 0 


a a t g c c a c c 


acccccttcc 


ccagcct gca 


cc tct cctca 


ttgctgcaat 


gttcacgttc 


60 0 


aa-:jacaaact 


tccctgtggg 


ctccatggag 


ctcctgggtc 


cagaagtcct 


catctcaagg 


6 6 0 




qat qqq t t qq 


qqct qqaqaq 


qa tat qcaqq 


gat cct ggaa 


g'^gtaagggc 


72 0 


c a a g i: a a 1 1 1 


qqt aqt aqqq 


qaaqqqcaqa 


aaqqaact qq 


qttatqqaaq 


tgatccaaag 


78 0 


agcaggga tg 


qqaatc tqqc 


tgcatat ttg 


gt cct gaaaa 


qqqtqtctqa 

^ ^ z> z> z> 


gaacctaccc 


84 0 


ccttctaatc 


ttg tcccacc 


taaactgtag 


ttgtctgccc 


tgtgctatcc 


ttgctgctt'i: 


9 0 0 


':;agct ctgcc 


cca tec tcct 


tccagtgtct 


gttcctgagt 


aggggcaggg 


gaaataggag 


960 


i^agagttgca 


aaagaggctg 


^ z:! "t" 

ciygayyyL-ciu 


^ '\~ 


(^"t" i~ "t" n 'i~ 

\-.L.u yyyy ^y 


y y y y 


1 1') 2 0 


ctagatgctt 


gggcatttat 


ggtagttatt 


ttatatcatt 


tgattaataa 


aaatattgga 


1080 


aaat'fjtaaag 


aaaaaaaaag 


aaaaaaacat 


ggggccgaaa 


ccttatcccc 


cttgagtagg 


1140 


gtgatatttt 


gcgtgtgcaa 


t gggcggcct 


gttttcgaga 


ggcggtgaca 


tggggaaaac 


1200 


atgggggtgt 


accaaacctt 


aaccgccttt 


taggggaaac 


accccttttg 


ccgcaagtgg 


1260 


gttaataacg 


gaagaagccc 


gg'::cggattg 


cccttcacaa 


gagtctcccg 


cggtagatgc 


1320 


^gatgggaca 


gcccccttcg 


gc'jgcgttta 


gagcggcgtg 


tgtgtggttt 


ctacgcgaat 


138 0 


agggataaat 


attgtggcgg 


cg'i^cgaggga 


gtgtgtgtgt 


tgcgcgcctg 


cttctgtgga 


1440 


ggtggtgtgt 


cccaaaaac t 


aaaagggccc 


ttttgtgcgc 


gttagtttgc 


tctagcagag 


1500 


tc c g':;tgcac 


atattttggt 


gggcgtgtcc 


gtgccgcccg 


nggtggtgct 


tgttgctggc 


1560 


gtggcgtggg 


gtgggtgtgg 


ttgcgggggt 


ggtcgtgttg 


ggtgtgtgcg 


tgcgcgcggg 


1620 


ggccgtgtgt 


gtgtgtggt t 


gcatgataag 


gttagagtga 


gtgagagcgg 




1670 



-210 > 3 
-:21L > 131 
-:212^ PRT 



' .213 > Mus sp . 
■ : 4 0 0 > 3 

Ser Tie Ser Ser 
I 

Leu Thr Glu Gly 

20 

Ser Ala Asn Ala 
35 

lie Glu Thr Leu 
50 

Lys Glu Thr Leu 
65 

Leu Pro His Leu 

Asn Leu Gly Asn 
100 

Glu Leu lie Gly 
115 

Thr Arg Ser 
130 




Cys Gly Ala Gly 
5 

Arg Thr Ser Val 

Glu Asp Ala Gin 

40 

lie Lys Asn Phe 

55 

Thr Pro Ala Glu 
70 

Met Pro Ser Asn 
85 

Cys Asn Asp Ser 

Glu Ala Ala Lys 

120 



-3- 



Tyr Arg Thr Asp 
10 

Pro Gly Thr Met 

25 

Glu Phe Ser Asp 



His Lys Tyr Ser 

60 

Leu Arg Asp Leu 
75 

Cys Gly Leu Glu 
90 

Lys Leu Glu Phe 

105 

Ser Val Lys Met 




Asp Lys Thr Gin 
15 

Gly Gin Cys Arg 
30 

Val Glu Arg Ala 
45 

Val Ala Gly Lys 



Val Thr Gin Gin 
80 

Glu Lys lie Ala 
95 

Gly Ser Phe Trp 
110 

Glu Arg Pro Val 

125 



<210.> 4 

■ 211> 357 

■ 212 > DNA 

- 213 ^ Homo sapiens 

-:2 2 0> 



22 1> modified base 



-4- 



■r.r.2> (231) 

:223~> "n" represents a, t, c, g, other or unknown 



0:> 

I > modif ied_base 

:2::2.> (337) 

:JJ3> "n" represents a, t, c, 



other or unknown 



- 400> 4 

ataggacaac agaactctca ccaaaggacc agacacagtg agcaccatgg gacagtgtcg 60 

gtcagccaac gcagaggatg ctcaggaatt cagtgatgtg gagagggcca ttgagaccct 120 

>-atcaagaac tttcaccagt actccgtgga gggtgggaag gagacgctga ccccttctga 180 

gctacgggac ctggtcaccc agcagctgcc ccatctcatg ccgagcaact ntggcctgga 240 

a^-^agaaaatt gccaacctgg gcagctgcaa tgactctaaa ctggagttca ggagtttctg 300 

g^-jagctgatt ggagaagcgg ccaagagtgt gaagctngag aggactgtcc gggggca 357 



'■J L0-> 5 

■ Ii> 379 

■:;:i2> dna 

■ J 13^ Homo sapiens 
' 4 00 > 5 

gaattccaga gggagttctc agtgcccccg gacaggcctc tccagcttca cactcttggc 60 

.■:gcttctcca atcagctccc agaaactcct gaactccagt ttagagtcat tgcagctgcc 120 

■jag.^ttggca attttctctt ccaggccaca gttgctcggc atgagatggg gcagctgctg 180 

■jgtgaccagg tcccgtagct cagaaggggt cagcgtctcc ttcccaccct ccacggagta 240 

::tggtgaaag ttcttgatga gggtctcaat ggccctctcc acatcactga attcctgagc 300 

atcctctgcg ttggctgacc gacactgtcc catggtgctc actgtgtctg gtcctttggt 360 

gagagttctg ttgtcctat 37 9 



:J10> 6 
:jlL> 118 
:J12- PRT 

:213> Homo sapiens 



400 > 6 



Asp Asn Arg Thr 

1 

Gin Cys Arg Ser 
20 

Glu Arg Ala He 
35 

Glu Gly Gly Lys 

50 

Thr Gin Gin Leu 

6 5 

Lys He Ala Asn 

Ser Phe Trp Glu 
100 



Leu Thr Lys Gly 
5 

Ala Asn Ala Glu 



Glu Thr Leu He 
40 

Glu Thr Leu Thr 
55 

Pro His Leu Met 
70 

Leu Gly Ser Cys 

85 

Leu He Gly Glu 



-5- 

Pro Asp Thr Val 
10 

Asp Ala Gin Glu 

25 

Lys Asn Phe His 

Pro Ser Glu Leu 

60 

Pro Ser Asn Cys 
75 

Asn Asp Ser Lys 

90 

Ala Ala Lys Ser 

105 



Ser Thr Met Gly 
15 

Phe Ser Asp Val 
30 

Gin Tyr Ser Val 
45 

Arg Asp Leu Val 



Gly Leu Glu Glu 
80 

Leu Glu Phe Arg 

95 

Val Lys Leu Glu 

110 



Arg Pro Val Arg Gly His 
115 



■:210> 7 
-.211> 20 
■■.212:> DNA 

-:213> Artificial Sequence 

-:220> 

-:223> Description of Artificial Sequence: Primer 
■ 400> 7 

cttgatgagg gtctcaatgg 20 



210> 8 



• 



-6- 

v21l:> 26 
-.:.:12> DNA 

•:2i3> Artificial Sequence 
':220> 

223> Description of Artificial Sequence: Probe 
• 400 > 8 

(..-.ca'^atcact gaattcctga gcatcc 



• 2 LiJ > 9 

-■.211 ^ 20 

' 2 12 - DNA 

-:2I3' Artificial Sequence 

■:220 • 

•;22 3' Description of Artificial Sequence: Primer 

-:400 > 9 

■.:agacacagt gagcaccatg 



v210 > 10 

•211 > 98 

212 > PRT 

■ 2 13'^ Homo sapiens 

■:400 > 10 

Met Ala Ala Glu Pro Leu Thr Glu Leu Glu Glu Ser lie Glu Thr Val 

15 10 15 

Val Thr Thr Phe Phe Thr Phe Ala Arg Gin Glu Gly Arg Lys Asp Ser 

20 25 30 



Leu Ser Val Asn Glu Phe Lys Glu Leu Val Thr Gin Gin Leu Pro His 
35 40 45 




-7- 

Leu Leu Lys Asp Val Gly Ser Leu Asp Glu Lys Met Lys Ser Leu Asp 
50 55 60 

Val Asn Gin Asp Ser Glu Leu Lys Phe Asn Glu Tyr Trp Arg Leu lie 

65 70 75 80 

Gly Glu Leu /Via Lys Glu lie Arg Lys Lys Lys Asp Leu Lys lie Arg 

85 90 95 

Lys Lys 



-■.210. > 11 
•:211> 110 
v212> PRT 

':213> Homo sapiens 
-:400> 11 

Met Ser Gin Leu Glu Arg Asn lie Glu Thr lie lie Asn Thr Phe His 

15 10 15 

Gin Tyr Ser Val Lys Leu Gly His Pro Asp Thr Leu Asn Gin Gly Glu 
20 25 30 

Phe Lys Glu Leu Val Arg Lys Asp Leu Gin Asn Phe Leu Lys Lys Glu 
35 40 45 

Asn Lys Asn Glu Lys Val lie Glu His lie Met Glu Asp Leu Asp Thr 

50 55 60 

Asn Ala Asp Lys Gin Leu Ser Phe Glu Glu Phe lie Met Leu Met Ala 

65 70 75 80 

Arg Leu Thr Trp Ala Ser His Glu Lys Met His Glu Gly Asp Glu Gly 
85 90 95 



Pro Gly His His His Lys Pro Gly Leu Gly Glu Gly Thr Pro 



100 



105 



110 



-:210> 12 

^:211> 37 

^:212:> PRT 

-:213:> Unknown Organism 
v220> 

v223> Description of Unknown Organism: 3-100/ICaBP type 
calcxum binding protein 

• 400> 12 

Ser Asn Cys Gly Leu Glu Glu Lys lie Ala Asn Leu Gly Ser Cys Asn 
15 10 15 

Asp Ser Lys Leu Glu Phe Arg Ser Phe Trp Glu Leu lie Gly Glu Ala 
20 25 30 

Ala Lys Ser Val Lys 
35 



<210> 13 

<2il> 37 

- jl2> PRT 

'.213> Unknown Organism 
v:220> 

v223> Description of Unknown Organism: 3-100/lCaBP type 
calcium binding protein 

< 400> 13 

Asp Val Glu Arg Ala lie Glu Thr Leu lie Lys Asn Phe His Gin Tyr 
15 10 15 



Ser Val Glu Gly Gly Lys Glu Thr Leu Thr Pro Ser Glu Leu Arg Asp 

20 25 30 




-9- 



Leu Val Thr Gin Gin 
35 



•:r.lO ^ 14 

<211> 19 

^■2X2' PRT 

^:J13 ' Unknown Organism 

• ■220 > 

^'.223^ Description of Unknown Organism: Bacterial type II 
secretion system protein F 

' 4i:>0> 14 

Val Thr Gin Gin Leu Pro His Leu Met Pro Ser Asn Cys Gly Leu Glu 

15 10 15 

Glu Lys lie 



•:::10' 1^ 
•:.:il> 10 
-:::12> PRT 

-:2 13'^ Unknown Organism 

' 220 > 

■ :223> Description of Unknown Organism: Ubiquitin 
carboxyl- terminal hydrolases family 

•:400> 15 

Ala Asn Ala Glu Asp Ala Gin Glu Phe Ser 
15 10 



-.■^lO 16 
• Jll> 49 



-10- 



<:212 > PRT 

<:213 - Unknown Organism 

''.:23 • Description of Unknown Organism: Bacterial 
themotaxis sensory transducers protein 

'400:> 16 

Gly Gly Lys Glu Thr Leu Thr Pro Ser Glu Leu Arg Asp Leu Val Thr 
15 10 15 

Gin Gin Leu Pro His Leu Met Pro Ser Asn Cys Gly Leu Glu Glu Lys 

20 25 30 

lie Ala Asn Leu Gly Ser Cys Asn Asp Ser Lys Leu Glu Phe Arg Ser 
35 40 45 



Ph': 



■ 2 10 • 17 

- :l • 32 

• 2 l2 ' PRT 

•2 13 ' Unknown Organism 
^ 220 ^ 

'223 > Description of Unknown Organism: 

Phosphoenolpyruvate carboxykinase (ATP) protein 

- 400 > 17 

Met Gly Gin Cys Arg Ser Ala Asn Ala Glu Asp Ala Gin Glu Phe Ser 
15 10 15 

Asp Val Glu Arg Ala lie Glu Thr Leu lie Lys Asn Phe His Gly Tyr 

20 25 30 



210 > 18 




-1 1- 

v211> 33 
•:jl2> PPT 

- '.2 13: ' Unknown Organism 

■ JCO:- 

■ 2 23:^ Description of Unknown Organism: Prokaryotic- type 

carbonic anhydrases proteins 

'.400> 18 

His Gin Tyr Ser Val Glu Gly Gly Lys Glu Thr Leu Thr Pro Ser Glu 
15 10 15 

Leu Arg Asp Leu Val Thr Gin Gin Leu Pro His Leu Met Pro Ser Asn 

20 25 30 



■:210> 19 

-21^^ It 

■ 2 12^ PRT 

v2I3:> Unknown Organism 

■ 220 > 

-:223> Description of Unknown Organism: Ergosterol 
biosynthesis ERG4/ERG24 family protein 

■ A')02 19 

lie Ala Asn Leu Gly Ser Cys Asn Asp Ser Lys Leu Glu Phe Arg 
15 10 15 



•210 > 20 

•:21l:> 25 

-212- PRT 

' .2 13 ' Unknown Organism 



220> 




-12- 

•:223 > Description of Unknown Organism: 

Lysosome-associated membrane glycoproteins du 

<400 > 20 

Leu Met Pro Ser Asn Cys Gly Leu Glu Glu Lys lie Ala Asn Leu Gly 
^5 10 15 

Ser Cys Asn Asp Ser Lys lie Glu Phe 
20 25 



•:j.0 > 21 
■:2ll > 39 
":2 12 > PRT 

•:.":L3 > Unknown Organism 

-:220 > 

•223* Description of Unknown Organism: 
Phosphof ructokinase proteins 

- 400 ' 21 

Pro Ser Asn Cys Gly Leu Glu Glu Lys lie Ala Asn Leu Gly Ser Cys 
15 10 15 

Asn Asp Ser Lys Leu Glu Phe Arg Ser Phe Trp Glu Leu lie Gly Glu 
20 25 30 

Ala Ala Lys Ser Val Lys Leu 
35 



^:2 10 > 22 
-:2il> 10 
^ 2 12 • PRT 

'2 13^ Unknown Organism 

•:220> 



223 • Eiescription of Unknown Organism: PH domain 



-13- 



P'roteins profile 



•:400.> 22 

rro Ser Asn Cys Gly Leu Glu Glu Lys lie 
I 5 10 



• 210:> 23 

• 2 Ll> 45 

- 212> PRT 

•213> Unknown Organism 
' 220> 

■223> Description of Unknown Organism: Myotoxins 
P'rotem 

- 400 > 23 

Glz\ Gin Leu Pro His Leu Met Pro Ser Asn Cys Gly Leu Glu Glu Lys 
15 10 15 

lie Ala Asn Leu Gly Ser Cys Asn Asp Ser Lys Leu Glu Phe Arg Ser 
20 25 30 

i'l-ie Trp Glu Leu lie Gly Glu Ala Ala Lys Ser Val Lys 
35 40 45 



-:2I0> 24 

•211- 17 

- 2 L2 - PRT 

-:2L3~> Unknown Organism 
• 220> 

■:223'^ Description of Unknown Organism: 

Phosphatidylinosi tol-specif ic phospholipase X 

■ 400 > 24 

Met Pro Ser Asn Cys Gly Leu Glu Glu Lys lie Ala Asn Leu Gly Ser 



-14- 



10 15 



Cys 



^ 210^> 25 

■ 211> 45 

- 212> PRT 

• 213:' Unknown Organism 

■ 2 2 0:- 

-:223:' Description of Unknown Organism: Glypicans 
protein 

- 4'.)0 > 2 5 

Met Gly Gly Cys Arg Ser Ala Asn Ala Glu Asp Ala Gin Glu Phe Ser 
15 10 15 

Asp Val Glu Arg Ala lie Glu Thr Leu lie Lys Asn Phe His Gin Tyr 

20 25 30 

Ser Val Glu Gly Gly Lys Glu Thr Leu Thr Pro Ser Glu 
35 40 45 



■:210> 26 
■:211> 10 
■ 212:> PRT 

•:2 13> Unknown Organism 

• 220> 

- 223> Description of Unknown Organism: Membrane attack 
complex components/perform 



' 400> 26 

lie Lys Asn Phe His Gin Tyr Ser Val Glu 
15 10 



-15- 



•:.^10:> 21 
■ 44 
-:.:i2:- PRT 

• 213 • Unknown Organism 

• ::20:' 

•:::23v Description of Unknown Organism: Urease nickel 
ligands protein 

■400^ 21 

Thr Leu lie Lys Asn Phe His Gin Tyr Ser Val Glu Gly Gly Lys Glu 

15 10 15 

Tlir Leu Thr Pro Ser Glu Leu Arg Asp Leu Val Thr Gin Gin Leu Pro 

20 25 30 

}{LS Leu Met Pro Ser Asn Cys Gly Leu Glu Glu Lys 
35 40 



■ 2 10 > 28 

•2Ii-> 13 

• 2 12 > PRT 

-;213.> Unknown Organism 

■:220 > 

•:223 > Description of Unknown Organism: Phosphoglycerate 
mutase family phosphohi stidi 

-:400 > 28 

Gin Glu Phe Ser Asp Val Glu Arg Ala lie Glu Thr Leu 

15 10 



210:> 29 
211> 10 
212> PRT 




-16- 

- 213:' Unknown Organism 
• .220 > 

vj^B' Description of Unknown Organism: Ribosomal protein 
L23 protein 

<400> 29 

Glu Leu Arg Asp Leu Val Thr Gin Gin Leu 
I 5 10 



■ 2 10:^ 30 
•:2Ll - 41 

■ 2 L2> PRT 

^:2L3> Unknown Organism 

'220:- 

':223:' Description of Unknown Organism: 

2 ' -5 ' -ol igoadenylate synthetases protein 

■:4 00 ' 30 

Leu Glu Glu Lys lie Ala Asn Leu Gly Ser Cys Asn Asp Ser Lys Leu 

L 5 10 15 

Glu Phe Arg Ser Phe Trp Glu Leu lie Gly Glu Ala Ala Lys Ser Val 
20 25 30 

Lys Leu Glu Arg Pro Val Arg Gly His 
35 40 



•.:2i0> 31 

211:* 24 

• 2 12:- PRT 

• 2 13:^ Unknown Organism 
' 220:> 



223;' Description of Unknown Organism: Formate and 




-17- 

nitrite transporters protein 
■400- 31 

Asp Leu Val Thr Gin Gin Leu Pro His Leu Met Pro Ser Asn Cys Gly 
15 10 15 

Leu Glu Glu Lys lie Ala Asn Leu 

20 



• 2 LO > 32 

■2 11 > 34 

- 2 12 > PRT 

- 2 13 > Unknown Organism 

■ 220 > 

^223> Description of Unknown Organism: Glycoprotein 
hormones beta chain protein 

■ 400 • 32 

Gin Gin Cys Arg Ser Ala Asn Ala Glu Asp Ala Gin Glu Phe Ser Asp 
L 5 10 15 

Val Glu Arg AIsl lie Glu Thr Leu lie Lys Asn Phe His Gin Tyr Ser 

20 25 30 

Val Glu 



-:2:0:> 33 

<2Il:> 21 

<2I2> PRT 

-.213> Unknown Organism 

<:220> 

':223:> Description of Unknown Organism: Vinculin family 
talin-binding region protein 




-18- 



':400> 33 

Lys C^lu Thr Leu Thr Pro Ser Glu Leu Arg Asp Leu Val Thr Gin Gin 
15 10 15 

Leu Pro His Leu Met 
20 



■:2I0> 34 

•:2il> 40 

<2i2:- PRT 

<2L3> Unknown Organism 

• 220: ■ 

- :2 2 3> Eiescription of Unknown Organism: Beta-lactamases 
clas B protein 

■:400> 34 

He Glu Thr Leu He Lys Asn Phe Hxs Gin Tyr Ser Val Glu Gly Gly 
15 10 15 

Lys Glu Thr Leu Thr Pro Ser Glu Leu Arg Asp Leu Val Thr Gin Gin 

20 25 30 

Leu Pro His Leu Met Pro Ser Asn 
35 40 



' 2 10 > 35 
^2^1^ 21 
- 2 12 ' PRT 

• 2 13 ■ Unknown Organism 

-.220> 

^:223> Eiescription of Unknown Organism: Heat shock hsp20 
protein family profile 




-19- 

- 400:- 35 

Leu Glu Phe Arg Ser Phe Trp Glu Leu lie Gly Glu Ala Ala Lys Ser 
15 10 15 

Val Lys Leu Glu Arg 

20 



-:210> 36 

^:jll:- 35 

<212> PRT 

•:jl3:' Unknown Organism 

• :;:20> 

•:.:23> Description of Unknown Organism: 

Hydroxymethylglutaryl-coensyme A lyase protein 

-400 ' 36 

Met Gly Gly Cys Arg Ser Ala Asn Ala Glu Asp Ala Gin Glu Phe Ser 
15 10 15 

Asp Val Glu Arg Ala lie Glu Thr Leu lie Lys Asn Phe His Gin Tyr 

20 25 30 

Ser Val Glu 

35 



■ 10 > 37 
':21I > 294 

• 2 12 > DNA 

• :2 13 * Mus sp . 

-;400 > 37 

gaattccaga gggagttctc agtgcccccg 

::gcttctcca atcagctccc agaaactcct 

caggttggca attttctctt ccaggccaca 

ggtgaccagg tcccgtagct cagaaggggt 



gacaggcctc tccagcttca cactcttggc 60 

gaactccagt ttagagtcat tgcagctgcc 120 

gttgctcggc atgagatggg gcagctgctg 180 

cagcgtctcc ttcccaccct ccacggagta 240 




-20- 

(-tggtgaaag ttcttgatga gggtctcaat ggccctctcc acatcactga attc 294 

-:210> 38 
' > 43 

• J 12 ^ PP.T 

• 2 13 - Homo sapiens 
-400- 38 

Val Glu Arg Ala lie Glu Thr Leu lie Lys Asn Phe His Gin Tyr Ser 

15 10 15 

Val Glu Gly Gly Lys Glu Thr Leu Thr Pro Ser Glu Leu Arg Asp Leu 
20 25 30 

Val Thr Gin Gin Leu Pro His Leu Met Pro Ser 
35 40 

• 210 ' 39 
•:211 • 44 
•:212> PP.T 

•:213 ' Homo sapiens 
-MOO ' 3S^ 

Asn Cys Gly Leu Glu Glu Lys lie Ala Asn Leu Gly Ser Cys Asn Asp 
15 10 15 

Ser Lys Leu Glu Phe Arg Ser Phe Trp Glu Leu lie Gly Glu Ala Ala 

20 25 30 

Tjys Ser Val Lys Leu Glu Arg Pro Val Arg Gly His 
35 40 

':;:io - 4 0 

-:21 L • 44 
*:212 • PPT 




<:213> Artificial Sequence 

<:220> 

< 223> Description of Artificial 
sequence 

v4 00> 40 

Asn Cys Gly Leu Glu Glu Lys lie 
1 5 

Ser Lys Leu Glu Phe Arg Ser Phe 
20 

Lys Ser Val Lys Leu Glu Arg Pro 
35 40 




-21- 

Sequence: Consensus 

Ala Asn Leu Gly Ser Cys Asn Asp 
10 15 

Trp Glu Leu lie Gly Glu Ala Ala 
25 30 

Val Arg Gly His 



•:210~> 41 
■ ..'^Ll~- 41 
^:2L2-> PRT 

'■ 2 13"' Unknown Organism 
^220 

•.:223 > Description of Unknown Organism: gi/ 4139958 /pdb/ IMHO 
v:400~> 41 

Glu Lys Ala Val Val Ala Leu lie Asp Val Phe His Gin Tyr Ser Gly 
15 10 15 

Arg Glu Gly Asp Lys His Lys Leu Lys Lys Ser Glu Leu Lys Glu Leu 
20 25 30 

lie Asn Asn Glu Leu Ser His Phe Leu 
35 40 



210 > 42 
211-> 41 



-22- 



•:212 ^ PRT 

:J13-- Unknown Organism 
:j20 ' 

^:j23 ' Description of Unknown Organxsm: Protein MRP-126 
-:400 > 42 

Glu Lys Ala lie Asp Val lie He Asp Val Phe His Gin Tyr Ser Arg 
15 10 15 

Arg ':;iu Gly Asp Lys Asp Thr Leu Thr Arg Lys Glu Leu Lys Leu Leu 
20 25 30 

lie Glu Lys Gin Leu Ala Asn Tyr Leu 
35 40 



':2I0> 43 
II ^ 41 
• : 2 12 > PRT 

- L3 • Unknown Organism 

•:220 > 

'223 • Description of Unknown Organism: ICTACALCIN 

:400 - 43 

Gin Lys Gly Met Ala Leu Leu He Ser Thr Phe His Lys Tyr Ser Gly 
15 10 15 

Lys Glu Gly Asp Lys Cys Thr Leu Thr Lys Gly Glu Leu Lys Asp Leu 
20 25 30 

He Thr Lys Glu Leu Gly Gly Ala Phe 
35 40 



210> 44 
211:> 41 



# 



-23- 



-.212 ' PRT 

* _13 ' Unknown Organism 
< 2202 

^.223 ' Descrxption of Unknown Organism: CALGRANULIN B 
^ 400 ' 44 

Glu Ser Ser lie Glu Thr lie lie Asn lie Phe His Gin Tyr Ser Val 
15 10 15 

Arg Leu Gly His Tyr Asp Thr Leu He Gin Lys Glu Phe Lys Gin Leu 
20 25 30 

Val Gin Lys Glu Leu Pro Asn Phe Leu 

35 40 



2 10 ' 45 
2 11 > 10 
2 12 - PRT 

2 13* Artificial Sequence 
.'2 2 0 ' 

■.223 ' Description of Artificial Sequence: Consensus 
sequence 

400 - 45 

: :.e Phe His Tyr Ser Gly Leu Glu Leu Leu 

:. 5 10 



'210 * 46 

<2 I L ' 41 

' 2 12 ^ PRT 

v2I3 ' Unknown Organism 



220 ' 

223 - Description of Unknown Organism: CALGRANULIN B 




-24- 



< 400> 46 

Glu Arg Ser lie Thr Thr lie lie Asp Thr Phe His Gin Tyr Ser Arg 
15 10 15 

Lys Glu Gly His Pro Asp Thr Leu Ser Lys Lys Glu Phe Arg Gin Met 
20 25 30 

Val Glu Ala Gin Leu Ala Thr Phe Met 
35 40 



<210> 47 
<.21l:> 11 
<.212> PRT 

<:213:> Artificial Sequence 

< 220^> 

< 223 :> Description of Artificial Sequence: Consensus 

sequence 

^ 400> 47 

Glu lie Phe His Gin Tyr Ser Gly Leu Glu Leu 
1 5 10 



<210:> 48 
<211> 44 
^ 212> PRT 

<:213> Unknown Organism 
^.220> 

v223> Description of Unknown Organism: Alignment 
sequence 



< 4O0.> 48 

Leu Glu Lys Ala lie Glu Thr lie lie Asn Val Phe His Gin Tyr Ser 
15 10 15 



-25- 



Gly Arg Glu Gly Asp Lys Asp Thr Leu Ser Lys Lys Glu Leu Lys Glu 

20 25 30 



Leu Leu Glu Lys Glu Leu Pro Asn Phe Leu Lys Asn 
35 40 



